AITopics

2506.18396

Country: Europe > Italy > Campania (0.14)

Genre: Research Report (0.40)

Industry:

Health & Medicine > Therapeutic Area > Oncology > Leukemia (1.00)
Health & Medicine > Therapeutic Area > Hematology (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Fuzzy Logic (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)

arXiv.org Artificial IntelligenceJan-6-2025

LHGNN: Local-Higher Order Graph Neural Networks For Audio Classification and Tagging

Singh, Shubhr, Benetos, Emmanouil, Phan, Huy, Stowell, Dan

Transformers have set new benchmarks in audio processing tasks, leveraging self-attention mechanisms to capture complex patterns and dependencies within audio data. However, their focus on pairwise interactions limits their ability to process the higher-order relations essential for identifying distinct audio objects. To address this limitation, this work introduces the Local- Higher Order Graph Neural Network (LHGNN), a graph based model that enhances feature understanding by integrating local neighbourhood information with higher-order data from Fuzzy C-Means clusters, thereby capturing a broader spectrum of audio relationships. Evaluation of the model on three publicly available audio datasets shows that it outperforms Transformer-based models across all benchmarks while operating with substantially fewer parameters. Moreover, LHGNN demonstrates a distinct advantage in scenarios lacking ImageNet pretraining, establishing its effectiveness and efficiency in environments where extensive pretraining data is unavailable.

artificial intelligence, imagenet, machine learning, (12 more...)

2501.03464

Country: Europe (0.46)

Genre: Research Report (0.64)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Abdullah, Abdulhady Abas, Ahmed, Aram Mahmood, Rashid, Tarik, Veisi, Hadi, Rassul, Yassin Hussein, Hassan, Bryar, Fattah, Polla, Ali, Sabat Abdulhameed, Shamsaldin, Ahmed S.

Advanced Clustering Techniques for Speech Signal Enhancement: A Review and Metanalysis of Fuzzy C-Means, K-Means, and Kernel Fuzzy C-Means Methods

arXiv.org Artificial IntelligenceSep-28-2024

Speech signal processing is a cornerstone of modern communication technologies, tasked with improving the clarity and comprehensibility of audio data in noisy environments. The primary challenge in this field is the effective separation and recognition of speech from background noise, crucial for applications ranging from voice-activated assistants to automated transcription services. The quality of speech recognition directly impacts user experience and accessibility in technology-driven communication. This review paper explores advanced clustering techniques, particularly focusing on the Kernel Fuzzy C-Means (KFCM) method, to address these challenges. Our findings indicate that KFCM, compared to traditional methods like K-Means (KM) and Fuzzy C-Means (FCM), provides superior performance in handling non-linear and non-stationary noise conditions in speech signals. The most notable outcome of this review is the adaptability of KFCM to various noisy environments, making it a robust choice for speech enhancement applications. Additionally, the paper identifies gaps in current methodologies, such as the need for more dynamic clustering algorithms that can adapt in real time to changing noise conditions without compromising speech recognition quality. Key contributions include a detailed comparative analysis of current clustering algorithms and suggestions for further integrating hybrid models that combine KFCM with neural networks to enhance speech recognition accuracy. Through this review, we advocate for a shift towards more sophisticated, adaptive clustering techniques that can significantly improve speech enhancement and pave the way for more resilient speech processing systems.

artificial intelligence, fuzzy c-means, machine learning, (15 more...)

2409.19448

Country:

Asia > Middle East > Iraq > Erbil Governorate > Erbil (0.04)
Asia > Middle East > Iraq > Kurdistan Region (0.04)
Europe > Middle East > Cyprus > Nicosia > Nicosia (0.04)
Asia > Middle East > Iran > Tehran Province > Tehran (0.04)

Genre:

Research Report > New Finding (1.00)
Overview (1.00)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Speech (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)

arXiv.org Artificial IntelligenceMay-22-2024

Adaptive Fuzzy C-Means with Graph Embedding

Chen, Qiang, Yu, Weizhong, Nie, Feiping, Li, Xuelong

Fuzzy clustering algorithms can be roughly categorized into two main groups: Fuzzy C-Means (FCM) based methods and mixture model based methods. However, for almost all existing FCM based methods, how to automatically selecting proper membership degree hyper-parameter values remains a challenging and unsolved problem. Mixture model based methods, while circumventing the difficulty of manually adjusting membership degree hyper-parameters inherent in FCM based methods, often have a preference for specific distributions, such as the Gaussian distribution. In this paper, we propose a novel FCM based clustering model that is capable of automatically learning an appropriate membership degree hyper-parameter value and handling data with non-Gaussian clusters. Moreover, by removing the graph embedding regularization, the proposed FCM model can degenerate into the simplified generalized Gaussian mixture model. Therefore, the proposed FCM model can be also seen as the generalized Gaussian mixture model with graph embedding. Extensive experiments are conducted on both synthetic and real-world datasets to demonstrate the effectiveness of the proposed model.

algorithm, mixture model, objective function, (15 more...)

2405.13427

Country:

Asia > China > Shaanxi Province > Xi'an (0.04)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
(2 more...)

Genre: Research Report (0.50)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)

#artificialintelligenceNov-25-2022, 15:30:06 GMT

Applications of C-Means Clustering part2 (Basic Machine Learning)

Abstract: Clustering is an effective technique in data mining to group a set of objects in terms of some attributes. However, most of existing K-Means based clustering algorithms cannot deal with outliers well and are difficult to efficiently solve the problem embedded the L0-norm constraint. To address the above issues and improve the performance of clustering significantly, we propose a novel clustering algorithm, named REFCMFS, which develops a L2,1-norm robust loss as the data-driven item and imposes a L0-norm constraint on the membership matrix to make the model more robust and sparse flexibly. In particular, REFCMFS designs a new way to simplify and solve the L0-norm constraint without any approximate transformation by absorbing 0 into the objective function through a ranking function. These improvements not only make REFCMFS efficiently obtain more promising performance but also provide a new tractable and skillful optimization method to solve the problem embedded the L0-norm constraint.

basic machine learning, c-means clustering part2, l0-norm constraint, (7 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Saini, Astha, Babu, Prabhu

Comments on "Iteratively Re-weighted Algorithm for Fuzzy c-Means"

arXiv.org Artificial IntelligenceSep-16-2022

In this comment, we present a simple alternate derivation to the IRW-FCM algorithm presented in "Iteratively Re-weighted Algorithm for Fuzzy c-Means" for Fuzzy c-Means problem. We show that the iterative steps derived for IRW-FCM algorithm are nothing but steps of the popular Majorization Minimization (MM) algorithm. The derivation presented in this note is much simpler and straightforward and, unlike the derivation of IRW-FCM, the derivation here does not involve introduction of any auxiliary variable. Moreover, by showing the steps of IRW-FCM as the MM algorithm, the inner loop of the IRW-FCM algorithm can be eliminated and the algorithm can be effectively run as a "single loop" algorithm. More precisely, the new MM-based derivation deduces that a single inner loop of IRW-FCM is sufficient to decrease the Fuzzy c-means objective function, which speeds up the IRW-FCM algorithm.

algorithm, artificial intelligence, machine learning, (15 more...)

2209.07715

Country:

Oceania > Australia > Queensland (0.04)
North America > United States > New York (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > India > NCT > New Delhi (0.04)

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

López-Oriona, Ángel, D'Urso, Pierpaolo, Vilar, José Antonio, Lafuente-Rego, Borja

Quantile-based fuzzy C-means clustering of multivariate time series: Robust techniques

arXiv.org Machine LearningSep-22-2021

In particular, time series data have become ubiquitous in our days, arising frequently in a broad variety of fields including medicine, computer science, finance, environmental sciences, machine learning, marketing and neuroscience, among many others. Typically, time series involve a huge number of records, present dynamic behavior patterns which might change over time, and one frequently has to deal with realizations of different length. Due to this complex nature, standard techniques to perform data mining tasks as classification, clustering or anomaly detection often produce unsatisfactory results. Complexity is still greater by treating with high dimensional time series, where the interdependence structure and large dimensionality are serious obstacles to develop efficient procedures. Univariate time series (UTS) were the main focus of intensive research until recently, but multivariate time series (MTS) have received lately a great deal of attention due to the advance of technology and storage capabilities of everyday devices. Well-known examples of MTS are multi-lead ECG signals of patients or records containing several economic indicators of a given country over time, but many other examples can be easily obtained from different fields. Among time series data mining tasks, clustering is a central problem. In fact, identifying groups of similar series is basic for many applications in order to detect a few representative patterns, forecast future performances, quantify affinity, recognize dynamic changes and structural breaks... However, unlike traditional databases, similarity search in time series data is a complex issue that cannot be addressed with conventional methods.

scenario 1, scenario 2, time sery, (14 more...)

2109.11027

Country:

North America > Trinidad and Tobago > Trinidad > Arima > Arima (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Spain > Galicia > A Coruña Province > A Coruña (0.04)
Europe > Italy (0.04)

Genre: Research Report > Experimental Study (0.45)

Industry:

Banking & Finance > Trading (1.00)
Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (0.54)
Health & Medicine > Therapeutic Area > Neurology (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.69)
Information Technology > Data Science > Data Mining > Anomaly Detection (0.68)

López-Oriona, Ángel, Vilar, José A., Pierpaolo-D'Urso, null

Quantile-based fuzzy clustering of multivariate time series in the frequency domain

arXiv.org Machine LearningSep-8-2021

A novel procedure to perform fuzzy clustering of multivariate time series generated from different dependence models is proposed. Different amounts of dissimilarity between the generating models or changes on the dynamic behaviours over time are some arguments justifying a fuzzy approach, where each series is associated to all the clusters with specific membership levels. Our procedure considers quantile-based cross-spectral features and consists of three stages: (i) each element is characterized by a vector of proper estimates of the quantile cross-spectral densities, (ii) principal component analysis is carried out to capture the main differences reducing the effects of the noise, and (iii) the squared Euclidean distance between the first retained principal components is used to perform clustering through the standard fuzzy C-means and fuzzy C-medoids algorithms. The performance of the proposed approach is evaluated in a broad simulation study where several types of generating processes are considered, including linear, nonlinear and dynamic conditional correlation models. Assessment is done in two different ways: by directly measuring the quality of the resulting fuzzy partition and by taking into account the ability of the technique to determine the overlapping nature of series located equidistant from well-defined clusters. The procedure is compared with the few alternatives suggested in the literature, substantially outperforming all of them whatever the underlying process and the evaluation scheme. Two specific applications involving air quality and financial databases illustrate the usefulness of our approach.

algorithm, procedure, time sery, (14 more...)

2109.03728

Country:

South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
North America > United States (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
(5 more...)

Genre:

Research Report > New Finding (0.45)
Research Report > Promising Solution (0.34)

Industry:

Information Technology (1.00)
Health & Medicine (1.00)
Banking & Finance > Trading (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Fuzzy Logic (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)

Siddique, Md. Abu Bakr, Arif, Rezoana Bente, Khan, Mohammad Mahmudur Rahman, Ashrafi, Zahidun

Implementation of Fuzzy C-Means and Possibilistic C-Means Clustering Algorithms, Cluster Tendency Analysis and Cluster Validation

arXiv.org Machine LearningNov-10-2018

Abstract-- In this paper, several two-dimensional clustering scenarios are given. In those scenarios, soft partitioning clustering algorithms (Fuzzy C-means (FCM) and Possibilistic c-means (PCM)) are applied. Afterward, VAT is used to investigate the clustering tendency visually, and then in order of checking cluster validation, three types of indices (e.g., PC, DI, and DBI) were used. After observing the clustering algorithms, it was evident that each of them has its limitations; however, PCM is more robust to noise than FCM as in case of FCM a noise point has to be considered as a member of any of the cluster. The clustering [1-3] is a subfield of data mining technique and it is very effective to pick out useful information from dataset.

artificial intelligence, data mining, machine learning, (13 more...)

1809.08417

Country: North America > United States > Mississippi (0.14)

Genre: Research Report (0.64)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)

Oztürk, Aybükë, Lallich, Stéphane, Darmont, Jérôme

A Visual Quality Index for Fuzzy C-Means

arXiv.org Machine LearningJun-5-2018

Cluster analysis is widely used in the areas of machine learning and data mining. Fuzzy clustering is a particular method that considers that a data point can belong to more than one cluster. Fuzzy clustering helps obtain flexible clusters, as needed in such applications as text categorization. The performance of a clustering algorithm critically depends on the number of clusters, and estimating the optimal number of clusters is a challenging task. Quality indices help estimate the optimal number of clusters. However, there is no quality index that can obtain an accurate number of clusters for different datasets. Thence, in this paper, we propose a new cluster quality index associated with a visual, graph-based solution that helps choose the optimal number of clusters in fuzzy partitions.

artificial intelligence, machine learning, quality index, (15 more...)

doi: 10.1007/978-3-319-92007-8

1806.01552

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)